Efficient Bounds for the Softmax Function and Applications to Approximate Inference in Hybrid models
نویسنده
چکیده
The softmax link is used in many probabilistic model dealing with both discrete and continuous data. However, efficient Bayesian inference for this type of model is still an open problem due to the lack of efficient upper bound for the sum of exponentials. We propose three different bounds for this function and study their approximation properties. We give a direct application to the Bayesian treatment of multiclass logistic regression and discuss its generalization to deterministic approximate inference in hybrid probabilistic graphical models. The softmax function is the extension of the sigmoid function for more than two values. Its role is of central importance in many non-linear probabilistic models. In particular, many well-known models deal with discrete and continuous data. Variational approximations based on the minimization of the Kullback-Leibler divergence are one of the most popular tools in large-scale Bayesian inference. In recent years, generic tools such as VIBES [1] have been proposed for inference and learning of graphical models using mean field approximations. For graphs having discrete nodes with continuous parents, the direct mean field cannot be applied, since there is no conjugate family for the multinomial logistic model. Local variational approximations have been proposed in the case of binary variables [2]. They are based on a quadratic lower bound for the log of the sigmoid function. This is used in directed probabilistic graphical models having binary variables with continuous parents [3, 1]. However, for more than 2 categories, the inference problem remains unsolved. In section 2, give an detailed overview of the techniques used by different authors to deal with Bayesian inference for models involving softmax links. In section 3, we derive three simple lower bounds and discuss their approximation properties. In Section 4 we perform numerical experiments on a typical application which is the variational approximation of the multiclass logistic regression. Finally, we discuss its application to more general deterministic approximate inference algorithms in graphical models.
منابع مشابه
The use of wavelet - artificial neural network and adaptive neuro fuzzy inference system models to predict monthly precipitation
Precipitation forecasting due to its random nature in space and time always faced with many problems and this uncertainty reduces the validity of the forecasting model. Nowadays nonlinear networks as intelligent systems to predict such complex phenomena are widely used. One of the methods that have been considered in recent years in the fields of hydrology is use of wavelet transform as a moder...
متن کاملDetermining the Optimal Value Bounds of the Objective Function in Interval Quadratic Programming Problem with Unrestricted Variables in Sign
In the most real-world applications, the parameters of the problem are not well understood. This is caused the problem data to be uncertain and indicated with intervals. Interval mathematical models include interval linear programming and interval nonlinear programming problems.A model of interval nonlinear programming problems for decision making based on uncertainty is interval quadratic prog...
متن کاملADAPTIVE NEURO-FUZZY INFERENCE SYSTEM OPTIMIZATION USING PSO FOR PREDICTING SEDIMENT TRANSPORT IN SEWERS
The flow in sewers is a complete three phase flow (air, water and sediment). The mechanism of sediment transport in sewers is very important. In other words, the passing flow must able to wash deposited sediments and the design should be done in an economic and optimized way. In this study, the sediment transport process in sewers is simulated using a hybrid model. In other words, using the Ada...
متن کاملAccurate Inference for the Mean of the Poisson-Exponential Distribution
Although the random sum distribution has been well-studied in probability theory, inference for the mean of such distribution is very limited in the literature. In this paper, two approaches are proposed to obtain inference for the mean of the Poisson-Exponential distribution. Both proposed approaches require the log-likelihood function of the Poisson-Exponential distribution, but the exact for...
متن کاملInference for the Type-II Generalized Logistic Distribution with Progressive Hybrid Censoring
This article presents the analysis of the Type-II hybrid progressively censored data when the lifetime distributions of the items follow Type-II generalized logistic distribution. Maximum likelihood estimators (MLEs) are investigated for estimating the location and scale parameters. It is observed that the MLEs can not be obtained in explicit forms. We provide the approximate maximum likelihood...
متن کامل